Preserving User Preferences in Document-Category Management: An Ontology-based Evolution Approach

نویسندگان

  • Yen-Hsien Lee
  • Chih-Ping Wei
  • Paul Jen-Hwa Hu
چکیده

Preserving the user’s preference in document-category management is essential because it affects his/her search efficiency, cognitive processing load, and satisfaction. Prior research has investigated automated document category evolution by using lexicon-based documentcategory evolution techniques which take into account the document categories previously created by the user. However, comparing documents at the lexical level cannot solve word mismatch or ambiguity problems effectively. To address such problems inherent to the lexicon-based approach, we propose an ONtology-based Category Evolution (ONCE) technique, which uses an appropriate ontology to support document-category evolution at the conceptual level rather than at the lexical level. Specifically, we develop an Ontology Enrichment (OE) technique for automatic leaning of concept descriptors in the adopted ontology. We empirically evaluate the effectiveness of the proposed ONCE technique, using a lexicon-based document-category evolution technique (i.e., CE2) and the hierarchical agglomerative clustering (HAC) technique for benchmark purposes. According to our empirical results, ONCE appears more effective than CE2 and HAC, and achieves higher clustering recall and precision.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Evolution-based Approach to Preserving User Preferences in Document-Category Management

Document clustering is critical to automated document management, hereby a set of documents are clustered in multiple categories, each containing similar or relevant documents. Most previous research assumes time invariability of document category; i.e., not evolving over time after creation. The adequacy of an existing category understandably may diminish as it includes influxes of new documen...

متن کامل

Automatic Workflow Generation and Modification by Enterprise Ontologies and Documents

This article presents a novel method and development paradigm that proposes a general template for an enterprise information structure and allows for the automatic generation and modification of enterprise workflows. This dynamically integrated workflow development approach utilises a conceptual ontology of domain processes and tasks, enterprise charts, and enterprise entities. It also suggests...

متن کامل

Automatic Workflow Generation and Modification by Enterprise Ontologies and Documents

This article presents a novel method and development paradigm that proposes a general template for an enterprise information structure and allows for the automatic generation and modification of enterprise workflows. This dynamically integrated workflow development approach utilises a conceptual ontology of domain processes and tasks, enterprise charts, and enterprise entities. It also suggests...

متن کامل

Increasing the Accuracy of Recommender Systems Using the Combination of K-Means and Differential Evolution Algorithms

Recommender systems are the systems that try to make recommendations to each user based on performance, personal tastes, user behaviors, and the context that match their personal preferences and help them in the decision-making process. One of the most important subjects regarding these systems is to increase the system accuracy which means how much the recommendations are close to the user int...

متن کامل

A New Ontology-Based Approach for Human Activity Recognition from GPS Data

Mobile technologies have deployed a variety of Internet–based services via location based services. The adoption of these services by users has led to mammoth amounts of trajectory data. To use these services effectively, analysis of these kinds of data across different application domains is required in order to identify the activities that users might need to do in different places. Researche...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007